Picture for Yitao Liu

Yitao Liu

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Add code
May 28, 2026
Viaarxiv icon

FineVLA: Fine-Grained Instruction Alignment for Steerable Vision-Language-Action Policies

Add code
May 26, 2026
Viaarxiv icon

Security in the Fine-Tuning Lifecycle of Large Language Models: Threats, Defenses,Evaluation, and Future Directions

Add code
May 24, 2026
Viaarxiv icon

Contextual Experience Replay for Self-Improvement of Language Agents

Add code
Jun 07, 2025
Figure 1 for Contextual Experience Replay for Self-Improvement of Language Agents
Figure 2 for Contextual Experience Replay for Self-Improvement of Language Agents
Figure 3 for Contextual Experience Replay for Self-Improvement of Language Agents
Figure 4 for Contextual Experience Replay for Self-Improvement of Language Agents
Viaarxiv icon

CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

Add code
Jun 26, 2024
Figure 1 for CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Figure 2 for CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Figure 3 for CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Figure 4 for CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Viaarxiv icon

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Add code
Apr 11, 2024
Figure 1 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 2 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 3 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 4 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Viaarxiv icon

OpenAgents: An Open Platform for Language Agents in the Wild

Add code
Oct 16, 2023
Figure 1 for OpenAgents: An Open Platform for Language Agents in the Wild
Figure 2 for OpenAgents: An Open Platform for Language Agents in the Wild
Figure 3 for OpenAgents: An Open Platform for Language Agents in the Wild
Figure 4 for OpenAgents: An Open Platform for Language Agents in the Wild
Viaarxiv icon

Lemur: Harmonizing Natural Language and Code for Language Agents

Add code
Oct 10, 2023
Viaarxiv icon

Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning

Add code
Sep 21, 2023
Figure 1 for Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning
Figure 2 for Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning
Figure 3 for Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning
Figure 4 for Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning
Viaarxiv icon

$\mathcal{Y}$-Tuning: An Efficient Tuning Paradigm for Large-Scale Pre-Trained Models via Label Representation Learning

Add code
Feb 20, 2022
Figure 1 for $\mathcal{Y}$-Tuning: An Efficient Tuning Paradigm for Large-Scale Pre-Trained Models via Label Representation Learning
Figure 2 for $\mathcal{Y}$-Tuning: An Efficient Tuning Paradigm for Large-Scale Pre-Trained Models via Label Representation Learning
Figure 3 for $\mathcal{Y}$-Tuning: An Efficient Tuning Paradigm for Large-Scale Pre-Trained Models via Label Representation Learning
Figure 4 for $\mathcal{Y}$-Tuning: An Efficient Tuning Paradigm for Large-Scale Pre-Trained Models via Label Representation Learning
Viaarxiv icon